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RAW SEQUENCE LISTING DATE : 08/09/2001 

PATENT APPLICATION: US/09/688,069 TIME: 13:27:54 

Input Set : A:\SeqList.txt 

Output Set: N:\CRF3\08092001\l688069.raw 

3 <110> APPLICANT : Subramaniam, S.; Slater, S.; Karberg, K.; Chen, R. ; Valentin, H. ; 

W ° ng ' s'<120> TITLE OF INVENTION : Nucleic Acid Sequences to Proteins Involved in Tocopherol 

Synthesis 

7 <130> FILE REFERENCE: 16515.054 

9 <140> CURRENT APPLICATION NUMBER: US 09/688,069 — , 

10 <141> CURRENT FILING DATE: 2000-10-14 E |\f f 

12 <160> NUMBER OF SEQ ID NOS : 114 

14 <210> SEQ ID NO: 1 

15 <211> LENGTH: 1182 y ^? 

16 <212> TYPE: DNA 

17 <213> ORGANISM: Arabidopsis sp . 

19 <400> SEQUENCE : 1 ^ cn 

21 atggagtctc tgctctctag ttcttctctt gtttccgctg ctggtgggtt ttgttggaag 60 

22 aagcagaatc taaagctcca ctctttatca gaaatccgag ttctgcgttg tgattcgagt 

23 aaagttgtcg caaaaccgaa gtttaggaac aatcttgtta ggcctgatgg tcaaggatct 

24 tcattgttgt tgtatccaaa acataagtcg agatttcggg ttaatgccac tgcgggtcag 
2 5 cctgaggctt tcgactcgaa tagcaaacag aagtctttta gagactcgtt agatgcgttt 

26 tacaggtttt ctaggcctca tacagttatt ggcacagtgc ttagcatttt atctgtatct 

27 ttcttagcag tagagaaggt ttctgatata tctcctttac ttttcactgg catcttggag 

28 gctgttgttg cagctctcat gatgaacatt tacatagttg ggctaaatca gttgtctgat 

2 9 gttgaaatag ataaggttaa caagccctat cttccattgg catcaggaga atattctgtt 

30 aacaccggca ttgcaatagt agcttccttc tccatcatga gtttctggct tgggtggatt 

31 gttggttcat ggccattgtt ctgggctctt tttgtgagtt tcatgctcgg tactgcatac 

32 tctatcaatt tgccactttt acggtggaaa agatttgcat tggttgcagc aatgtgtatc 

3 3 ctcgctgtcc gagctattat tgttcaaatc gccttttatc tacatattca gacacatgtg 
34 tttggaagac caatcttgtt cactaggcct cttattttcg ccactgcgtt tatgagcttt 
3 5 ttctctgtcg ttattgcatt gtttaaggat atacctgata tcgaagggga taagatattc 
3 6 ggaatccgat cattctctgt aactctgggt cagaaacggg tgttttggac atgtgttaca 
37 ctacttcaaa tggcttacgc tgttgcaatt ctagttggag ccacatctcc attcatatgg 
3 8 agcaaagtca tctcggttgt gggtcatgtt atactcgcaa caactttgtg ggctcgagct 

39 aagtccgttg atctgagtag caaaaccgaa ataacttcat gttatatgtt catatggaag 

40 ctcttttatg cagagtactt gctgttacct tttttgaagt ga 

43 <210> SEQ ID NO: 2 

44 <211> LENGTH: 393 

45 <212> TYPE: PRT 

46 <213> ORGANISM: Arabidopsis sp . 
48 <400> SEQUENCE: 2 

50 Met Glu Ser Leu Leu Ser Ser Ser Ser Leu Val Ser Ala Ala Gly Gly 

51 1 5 10 15 
53 Phe Cys Trp Lys Lys Gin Asn Leu Lys Leu His Ser Leu Ser Glu He 

25 30 
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240 
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360 

420 

480 

540 

600 
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720 

780 

840 

900 

960 
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1080 

1140 

1182 



54 20 



56 Arg Val Leu Arg Cys Asp Ser Ser Lys Val Val Ala Lys Pro Lys Phe 

57 35 40 45 

59 Arg Asn Asn Leu Val Arg Pro Asp Gly Gin Gly Ser Ser Leu Leu Leu 

60 50 55 60 

62 Tyr Pro Lys His Lys Ser Arg Phe Arg Val Asn Ala Thr Ala Gly Gin 

63 65 70 75 80 
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65 Pro Glu Ala Phe Asp Ser Asn Ser Lys Gin Lys Ser Phe Arg Asp Ser 
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RAW SEQUENCE LISTING DATE : 08/09/2001 

PATENT APPLICATION: US/09/688,069 TIME: 13:27:54 

Input Set : A:\SeqList.txt 

Output Set: N:\CRF3\08092001\I688069.raw 

66 85 90 95 

68 Leu Asp Ala Phe Tyr Arg Phe Ser Arg Pro His Thr Val lie Gly Thr 

69 100 105 HO 

71 val Leu Ser He Leu Ser Val Ser Phe Leu Ala Val Glu Lys Val Ser 

72 115 120 125 

74 Asp He Ser Pro Leu Leu Phe Thr Gly He Leu Glu Ala Val Val Ala 

76 130 135 I 40 

77 Ala Leu Met Met Asn He Tyr He Val Gly Leu Asn Gin Leu Ser Asp 

79 145 150 155 i 

80 Val Glu He Asp Lys Val Asn Lys Pro Tyr Leu Pro Leu Ala Ser Gly 

82 165 170 1 ?5 

83 Glu Tyr Ser Val Asn Thr Gly He Ala He Val Ala Ser Phe Ser He 

84 180 185 190 

86 Met Ser Phe Trp Leu Gly Trp He Val Gly Ser Trp Pro Leu Phe Tr P 

87 195 200 205 

89 Ala Leu Phe Val Ser Phe Met Leu Gly Thr Ala Tyr Ser He Asn Leu 

90 210 215 220 

92 Pro Leu Leu Arg Trp Lys Arg Phe Ala Leu Val Ala Ala Met Cys lie 
5 230 235 -i4U 

95 Leu Ala Val Arg Ala He He Val Gin He Ala Phe Tyr Leu His He 

96 245 250 255 

98 Gin Thr His Val Phe Gly Arg Pro He Leu Phe Thr Arg Pro Leu He 

99 260 265 270 

101 Phe Ala Thr Ala Phe Met Ser Phe Phe Ser Val Val He Ala Leu Phe 

102 275 280 285 

104 Lys Asp He Pro Asp He Glu Gly Asp Lys He Phe Gly He Arg Ser 

105 290 295 300 

107 Phe Ser Val Thr Leu Gly Gin Lys Arg Val Phe Tr P Thr Cys Val Thr 



108 305 310 



315 320 



110 Leu Leu Gin Met Ala Tyr Ala Val Ala lie Leu Val Gly Ala Thr Ser 

111 325 330 335 

113 Pro Phe He Trp Ser Lys Val He Ser Val Val Gly His Val He Leu 

114 340 345 350 

116 Ala Thr Thr Leu Trp Ala Arg Ala Lys Ser Val Asp Leu Ser Ser Lys 

117 355 360 365 

119 Thr Glu He Thr Ser Cys Tyr Met Phe He Trp Lys Leu Phe Tyr Ala 

120 370 375 380 

122 Glu Tyr Leu Leu Leu Pro Phe Leu Lys 

123 385 390 

126 <210> SEQ ID NO: 3 

127 <211> LENGTH: 1224 

128 <212> TYPE: DNA 

129 <213> ORGANISM: Arabidopsis sp . 

131 <400> SEQUENCE: 3 t . , cn 

133 atggcgtttt ttgggctctc ccgtgtttca agacggttgt tgaaatcttc cgtctccgta 60 

134 actLatctt cttcctctgc tcttttgcaa tcacaacata aatccttgtc eaatcctgtg 120 

135 actacccatt acacaaatcc tttcactaag tgttatcctt catggaatga taattaccaa 180 

136 gtatggagta aaggaagaga attgcatcag gagaagtttt ttggtgttgg ttggaattac 

137 agattaattt gtggaatgtc gtcgtcttct tcggttttgg agggaaagcc gaagaaagat 



240 
300 
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Input Set : A:\SeqList.txt 

Output Set: N:\CRF3\08092001\I688069.raw 

138 gataaggaga agagtgatgg tgttgttgtt aagaaagctt cttggataga tttgtattta 360 

139 ccagaagaag ttagaggtta tgctaagctt gctcgattgg ataaacccat tggaacttgg 420 

140 ttgcttgcgt ggccttgtat gtggtcgatt gcgttggctg ctgatcctgg aagccttcca 480 

141 agttttaaat atatggcttt atttggttgc ggagcattac ttcttagagg tgctggttgt 54 0 

142 actataaatg atctgcttga tcaggacata gatacaaagg ttgatcgtac aaaactaaga 600 

143 cctatcgcca gtggtctttt gacaccattt caagggattg gatttctcgg gctgcagttg 660 

144 cttttaggct tagggattct tctccaactt aacaattaca gccgtgtttt aggggcttca 720 
14 5 tctttgttac ttgtcttttc ctacccactt atgaagaggt ttacattttg gcctcaagcc 780 
14 6 tttttaggtt tgaccataaa ctggggagca ttgttaggat ggactgcagt taaaggaagc 840 
14 7 atagcaccat ctattgtact ccctctctat ctctccggag tctgctggac ccttgtttat 900 
14 8 gatactattt atgcacatca ggacaaagaa gatgatgtaa aagttggtgt taagtcaaca 960 

149 gcccttagat tcggtgataa tacaaagctt tggttaactg gatttggcac agcatccata 1020 

150 ggttttcttg cactttctgg attcagtgca gatctcgggt ggcaatatta cgcatcactg 1080 

151 gccgctgcat caggacagtt aggatggcaa atagggacag ctgacttatc atctggtgct 1140 

152 gactgcagta gaaaatttgt gtcgaacaag tggtttggtg ctattatatt tagtggagtt 1200 

153 gtacttggaa gaagttttca ataa 1224 

156 <210> SEQ ID NO: 4 

157 <211> LENGTH: 407 

158 <212> TYPE: PRT 

159 <213> ORGANISM: Arabidopsis sp . 
161 <400> SEQUENCE: 4 

16 3 Met Ala Phe Phe Gly Leu Ser Arg Val Ser Arg Arg Leu Leu Lys Ser 
164 15 10 15 

166 Ser Val Ser Val Thr Pro Ser Ser Ser Ser Ala Leu Leu Gin Ser Gin 

167 20 25 30 

16 9 His Lys Ser Leu Ser Asn Pro Val Thr Thr His Tyr Thr Asn Pro Phe 
170 35 40 45 

172 Thr Lys Cys Tyr Pro Ser Trp Asn Asp Asn Tyr Gin Val Trp Ser Lys 

173 50 55 60 

175 Gly Arg Glu Leu His Gin Glu Lys Phe Phe Gly Val Gly Trp Asn Tyr 

176 65 70 75 80 

178 Arg Leu lie Cys Gly Met Ser Ser Ser Ser Ser Val Leu Glu Gly Lys 

179 85 90 95 

181 Pro Lys Lys Asp Asp Lys Glu Lys Ser Asp Gly Val Val Val Lys Lys 

182 100 105 110 

184 Ala Ser Trp lie Asp Leu Tyr Leu Pro Glu Glu Val Arg Gly Tyr Ala 

185 115 120 125 

187 Lys Leu Ala Arg Leu Asp Lys Pro lie Gly Thr Trp Leu Leu Ala Trp 

188 130 135 140 

190 Pro Cys Met Trp Ser lie Ala Leu Ala Ala Asp Pro Gly Ser Leu Pro 

191 145 150 155 160 

193 Ser Phe Lys Tyr Met Ala Leu Phe Gly Cys Gly Ala Leu Leu Leu Arg 

194 165 170 175 

196 Gly Ala Gly Cys Thr He Asn Asp Leu Leu Asp Gin Asp He Asp Thr 

197 180 185 190 

199 Lys Val Asp Arg Thr Lys Leu Arg Pro He Ala Ser Gly Leu Leu Thr 

200 195 200 205 

202 Pro Phe Gin Gly He Gly Phe Leu Gly Leu Gin Leu Leu Leu Gly Leu 

203 210 215 220 
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205 Gly lie Leu Leu Gin Leu Asn Asn Tyr Ser Arg Val Leu Gly Ala Ser 

206 225 230 235 240 

208 Ser Leu Leu Leu Val Phe Ser Tyr Pro Leu Met Lys Arg Phe Thr Phe 

209 245 250 255 

211 Trp Pro Gin Ala Phe Leu Gly Leu Thr He Asn Trp Gly Ala Leu Leu 

212 260 265 270 

214 Gly Trp Thr Ala Val Lys "Gly Ser He Ala Pro Ser He Val Leu Pro 

215 275 280 285 

217 Leu Tyr Leu Ser Gly Val Cys Trp Thr Leu Val Tyr Asp Thr He Tyr 

218 290 295 300 

220 Ala His Gin Asp Lys Glu Asp Asp Val Lys Val Gly Val Lys Ser Thr 

221 305 310 315 320 

223 Ala Leu Arg Phe Gly Asp Asn Thr Lys Leu Trp Leu Thr Gly Phe Gly 

224 325 330 335 

226 Thr Ala Ser He Gly Phe Leu Ala Leu Ser Gly Phe Ser Ala Asp Leu 

227 340 345 350 

229 Gly Trp Gin Tyr Tyr Ala Ser Leu Ala Ala Ala Ser Gly Gin Leu Gly 

230 355 360 365 

232 Trp Gin He Gly Thr Ala Asp Leu Ser Ser Gly Ala Asp Cys Ser Arq 

233 370 375 380 

235 Lys Phe Val Ser Asn Lys Trp Phe Gly Ala He He Phe Ser Gly Val 

236 385 390 395 400 
2 38 Val Leu Gly Arg Ser Phe Gin 
239 405 

242 <210> SEQ ID NO : 5 

243 <211> LENGTH: 1296 

244 <212> TYPE: DNA 

245 <213> ORGANISM: Arabidopsis sp. 
247 <400> SEQUENCE: 5 

249 atgtggcgaa gatctgttgt ttctcgttta tcttcaagaa tctctgtttc ttcttcgtta D u 
2 50 ccaaacccta gactgattcc ttggtcccgc gaattatgtg ccgttaatag cttctcccag 120 

251 cctccggtct cgacggaatc aactgctaag ttagggatca ctggtgttag atctgatgcc 

252 aatcgagttt ttgccactgc tactgccgcc gctacagcta cagctaccac cggtgagatt 

253 tcgtctagag ttgcggcttt ggctggatta gggcatcact acgctcgttg ttattgggag 

254 ctttctaaag ctaaacttag tatgcttgtg gttgcaactt ctggaactgg gtatattctg 

255 ggtacgggaa atgctgcaat tagcttcccg gggctttgtt acacatgtgc aggaaccatg 420 

256 atgattgctg catctgctaa ttccttgaat cagatttttg agataagcaa tgattctaag 480 

257 atgaaaagaa cgatgctaag gccattgcct tcaggacgta ttagtgttcc acacgctgtt 

258 gcatgggcta ctattgctgg tgcttctggt gcttgtttgt tggccagcaa gactaatatg 

259 ttggctgctg gacttgcatc tgccaatctt gtactttatg cgtttgttta tactccgttg 

260 aagcaacttc accctatcaa tacatgggtt ggcgctgttg ttggtgctat cccacccttg 

261 cttgggtggg cggcagcgtc tggtcagatt tcatacaatt cgatgattct tccagctgct 780 

262 ctttactttt ggcagatacc tcattttatg gcccttgcac atctctgccg caatgattat 840 

263 gcagctggag gttacaagat gttgtcactc tttgatccgt cagggaagag aatagcagca 900 

264 gtggctctaa ggaactgctt ttacatgatc cctctcggtt tcatcgccta tgactggggg 960 

265 ttaacctcaa gttggttttg cctcgaatca acacttctca cactagcaat cgctgcaaca 1020 

266 gcattttcat tctaccgaga ccggaccatg cataaagcaa ggaaaatgtt ccatgccagt 1080 

267 cttctcttcc ttcctgtttt catgtctggt cttcttctac accgtgtctc taatgataat 114 0 

268 cagcaacaac tcgtagaaga agccggatta acaaattctg tatctggtga agtcaaaact 1200 

Pfease Nolo; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 

r C ?J I'" 9 t0 GnSUre lhat 3 corres P ond '"n9 explanation is presented in the <220> to 
**£3> fields of each sequence which presents at least one n or Xaa. 
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RAW SEQUENCE LISTING DATE: 08/09/2001 

PATENT APPLICATION : US/09/688, 069 TIME : 13:27:54 



Input Set : A:\SeqList.txt 

Output Set: N:\CRF3\08092001\l688069.raw 

269 cagaggcgaa agaaacgtgt ggctcaacct ccggtggctt atgcctctgc tgcaccgttt 1260 

270 cctttcctcc cagctccttc cttctactct ccatga 1296 

273 <210> SEQ ID NO : 6 

274 <211> LENGTH: 431 

275 <212> TYPE: PRT 

276 <213> ORGANISM: Arabidopsis sp. 
278 <400> SEQUENCE: 6 

280 Met Trp Arg Arg Ser Val Val Tyr Arg Phe Ser Ser Arg He Ser Val 

281 15 10 15 

283 Ser Ser Ser Leu Pro Asn Pro Arg Leu He Pro Trp Ser Arg Glu Leu 

284 20 25 30 

286 Cys Ala Val Asn Ser Phe Ser Gin Pro Pro Val Ser Thr Glu Ser Thr 

287 35 40 45 

2 89 Ala Lys Leu Gly lie Thr Gly Val Arg Ser Asp Ala Asn Arg Val Phe 
290 50 55 60 

292 Ala Thr Ala Thr Ala Ala Ala Thr Ala Thr Ala Thr Thr Gly Glu He 

293 65 70 75 80 

295 Ser Ser Arg Val Ala Ala Leu Ala Gly Leu Gly His His Tyr Ala Arg 

296 85 90 95 

298 Cys Tyr Trp Glu Leu Ser Lys Ala Lys Leu Ser Met Leu Val Val Ala 

299 100 105 110 

301 Thr Ser Gly Thr Gly Tyr He Leu Gly Thr Gly Asn Ala Ala He Ser 

302 115 120 125 

304 Phe Pro Gly Leu Cys Tyr Thr Cys Ala Gly Thr Met Met He Ala Ala 

305 130 135 140 

307 Ser Ala Asn Ser Leu Asn Gin He Phe Glu He Ser Asn Asp Ser Lys 

308 145 150 155 160 
310 Met Lys Arg Thr Met Leu Arg Pro Leu Pro Ser Gly Arg He Ser Val 
3H 165 170 175 

313 Pro His Ala val Ala Trp Ala Thr He Ala Gly Ala Ser Gly Ala Cys 

314 180 185 190 

316 Leu Leu Ala Ser Lys Thr Asn Met Leu Ala Ala Gly Leu Ala Ser Ala 

317 195 200 205 

319 Asn Leu Val Leu Tyr Ala Phe Val Tyr Thr Pro Leu Lys Gin Leu His 

320 210 215 220 

322 Pro He Asn Thr Trp Val Gly Ala Val Val Gly Ala He Pro Pro Leu 

323 225 230 235 240 

325 Leu Gly Trp Ala Ala Ala Ser Gly Gin He Ser Tyr Asn Ser Met He 

326 245 250 255 

328 Leu Pro Ala Ala Leu Tyr Phe Trp Gin He Pro His Phe Met Ala Leu 

329 260 265 270 

331 Ala His Leu Cys Arg Asn Asp Tyr Ala Ala Gly Gly Tyr Lys Met Leu 

332 275 280 285 

334 Ser Leu Phe Asp Pro Ser Gly Lys Arg He Ala Ala Val Ala Leu Arg 

335 290 295 300 

3 37 Asn Cys Phe Tyr Met He Pro Leu Gly Phe He Ala Tyr Asp Trp Gly 
338 305 310 315 320 

34 0 Leu Thr Ser Ser Trp Phe Cys Leu Glu Ser Thr Leu Leu Thr Leu Ala 
341 325 330 335 
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VERIFICATION SUMMARY DATE: 08/09/2001 

PATENT APPLICATION: US/09/688, 069 TIME: 13 : 27 : 55 

Input Set : A:\SeqList.txt 

Output Set: N:\CRF3\08092001\l688069.raw 



L 


391 


M 


341 


W 


(46) 


"n" 


or 


L 


392 


M 


341 


W 


(46) 


"n" 


or 


L 


844 


M 


341 


W 


(46) 


"n n 


or 


L 


845 


M 


341 


W 


(46) 


"n" 


or 


L 


846 


M 


341 


W 


(46) 


"n" 


or 


L 


945 


M 


341 


W 


(46) 


"n" 


or 


L 


960 


M 


341 


w 


(46) 


"n" 


or 


L 


979 


M 


341 


w 


(46) 


"n" 


or 


L 


980 


M 


341 


w 


(46) 


"n" 


or 


L 


982 


M: 


341 


w 


(46) 


"n" 


or 



L:2799 M:341 W: (46) "n" or 



"Xaa" used, for SEQ ID# : 8 

"Xaa" used, for SEQ ID# : 8 

"Xaa" used, for SEQ ID#:22 

"Xaa" used, for SEQ ID#:22 

"Xaa" used, for SEQ ID#:22 

"Xaa" used, for SEQ ID#:25 

"Xaa" used, for SEQ ID#:26 

"Xaa" used, for SEQ ID# : 27 

"Xaa" used, for SEQ ID#:27 

"Xaa" used, for SEQ ID# : 27 
"Xaa" used, for SEQ ID#:102 



file://C:\CRF3\Outhold\VsrI688069.htm 



8/9/01 



